semi-supervised segmentation
Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models
Mao, Yuchen, Li, Hongwei, Lai, Yinyi, Papanastasiou, Giorgos, Qi, Peng, Yang, Yunjie, Wang, Chengjia
Large-scale vision models like SAM have extensive visual knowledge, yet their general nature and computational demands limit their use in specialized tasks like medical image segmentation. In contrast, task-specific models such as U-Net++ often underperform due to sparse labeled data. This study introduces a strategic knowledge mining method that leverages SAM's broad understanding to boost the performance of small, locally hosted deep learning models. In our approach, we trained a U-Net++ model on a limited labeled dataset and extend its capabilities by converting SAM's output infered on unlabeled images into prompts. This process not only harnesses SAM's generalized visual knowledge but also iteratively improves SAM's prediction to cater specialized medical segmentation tasks via U-Net++. The mined knowledge, serving as "pseudo labels", enriches the training dataset, enabling the fine-tuning of the local network. Applied to the Kvasir SEG and COVID-QU-Ex datasets which consist of gastrointestinal polyp and lung X-ray images respectively, our proposed method consistently enhanced the segmentation performance on Dice by 3% and 1% respectively over the baseline U-Net++ model, when the same amount of labelled data were used during training (75% and 50% of labelled data). Remarkably, our proposed method surpassed the baseline U-Net++ model even when the latter was trained exclusively on labeled data (100% of labelled data). These results underscore the potential of knowledge mining to overcome data limitations in specialized models by leveraging the broad, albeit general, knowledge of large-scale models like SAM, all while maintaining operational efficiency essential for clinical applications.
Review for NeurIPS paper: Few-Cost Salient Object Detection with Adversarial-Paced Learning
Some related works missing There are some recent related works, such as [Ref. 1 Ref.6]. In [Ref. 1, 2], the authors integrated the self-paced learning into the object co-saliency detection related to the addressed task of this paper. These two are close to the proposed work, and it is better to provide the discussion. In [Ref 5, and Ref. 6], the authors also combine self-paced learning and adversarial learning, and I think these two works are mostly related to the proposed method, and I would like to see the difference between the proposed method and [Ref 5 and 6] Besides, in [3, 23, 27], semi-supervised learning for saliency detection is addressed, but in this paper, there is no detailed discussion between semi-supervised learning [23, 27] and the proposed few-cost setting. The primary difference should be provided.
Semi-Supervised Segmentation of Functional Tissue Units at the Cellular Level
Sydorskyi, Volodymyr, Krashenyi, Igor, Sakva, Denis, Zarichkovyi, Oleksandr
We present a new method for functional tissue unit segmentation at the cellular level, which utilizes the latest deep learning semantic segmentation approaches together with domain adaptation and semi-supervised learning techniques. This approach allows for minimizing the domain gap, class imbalance, and captures settings influence between HPA and HubMAP datasets. The presented approach achieves comparable with state-of-the-art-result in functional tissue unit segmentation at the cellular level.
Semi-supervised segmentation of tooth from 3D Scanned Dental Arches
Alsheghri, Ammar, Ghadiri, Farnoosh, Zhang, Ying, Lessard, Olivier, Keren, Julia, Cheriet, Farida, Guibault, Francois
Teeth segmentation is an important topic in dental restorations that is essential for crown generation, diagnosis, and treatment planning. In the dental field, the variability of input data is high and there are no publicly available 3D dental arch datasets. Although there has been improvement in the field provided by recent deep learning architectures on 3D data, there still exists some problems such as properly identifying missing teeth in an arch. We propose to use spectral clustering as a self-supervisory signal to joint-train neural networks for segmentation of 3D arches. Our approach is motivated by the observation that K-means clustering provides cues to capture margin lines related to human perception. The main idea is to automatically generate training data by decomposing unlabeled 3D arches into segments relying solely on geometric information. The network is then trained using a joint loss that combines a supervised loss of annotated input and a self-supervised loss of non-labeled input. Our collected data has a variety of arches including arches with missing teeth. Our experimental results show improvement over the fully supervised state-of-the-art MeshSegNet when using semi-supervised learning. Finally, we contribute code and a dataset.